A method to obtain sustained data quality at Distimo
نویسندگان
چکیده
This thesis attempts to answer the following research question: “how can we determine and improve data quality?”. A method is proposed to systematically analyse the demands and current state of data quality within an organisation. The mission statement and information systems architecture are used to characterise the organisation. A list of data quality characteristics based on literature is used to express the organisation in terms of data quality. Metrics are established to quantify the data quality characteristics. A risk analysis determines which are the most important areas to improve upon. After improvement, the metrics can be used to evaluate the success of the improvements. Distimo is an innovative application store analytics company aiming to solve the challenges created by a widely fragmented application store marketplace filled with equally fragmented information and statistics. As Distimo’s products are very data driven, data quality is very important. The method will be applied to Distimo as a case study. The proposed method provides a way to determine the current state of data quality, and to determine what to improve, and how to evaluate if the improvements provide the desired outcome. The case study of Distimo resulted in an in-depth analysis of Distimo, which in turn yielded a number of data quality improvements that at this very moment are in production and have improved data quality. Because of the generic nature of input data, the proposed method is applicable to any organisation looking to improve data quality. The iterative improvement process allow for fine grained control of changes to organisational processes and systems.
منابع مشابه
یک روش جدید افزایش دقت مکانی تصاویر سنجش از دور با استفاده از جدول جستجو
Different methods have been proposed to increase the image spatial resolution by mixed pixels decomposition. These methods can be divided into two groups. Some research have been attempted to obtain percentages of sub pixels and the other try to obtain their locations. These methods and their problems will be examined in this study. Common methods are reviewed with more emphasis. Finally, a new...
متن کاملInvestigating the Effect of Anodal tDCS on Sustained Attention in Patients with MS
Multiple sclerosis is a de-myelinating inflammatory condition of the central nervous system that is often thought of as an autoimmune disorder. These patients suffer from extensive cognitive impairments such as poor attention and concentration and memory and processing speed; Therefore, the aim of this study was to investigate the anodic effect of direct cortical electrical stimulation (tDCS) o...
متن کاملA simple and efficient DNA extraction protocol for old herbarium leaves of Bellevalia (Asparagaceae, Scilloideae)
High-quality DNA extraction plays an important role to make sharp bands in the gel electrophoresis and also produces clean chromatograms. Usually, DNA extract is delivered using the modified CTAB method but this method cannot obtain high-quality DNA for molecular analysis from old dried leaves of Bellevalia due to having different chemical compounds which inhibit to obtain a clear DNA extractio...
متن کاملتهیه و ارزیابی برونتن میکروسفرهای روکشدار آهسته رهش ترامادول هیدروکلرید
Abstract Introduction: Preparation of tramadol HCl sustained release delivery system as an analgesic drug could improve its efficacy, reduce side effects and increase patient compliance. Objective: The aim of the present study was to prepare tramadol HCl loaded microspheres and coat them by solvent evaporation (ESE) method, in order to obtain an appropriate sustained release behavior and to p...
متن کاملComparison Between Unsupervised and Supervise Fuzzy Clustering Method in Interactive Mode to Obtain the Best Result for Extract Subtle Patterns from Seismic Facies Maps
Pattern recognition on seismic data is a useful technique for generating seismic facies maps that capture changes in the geological depositional setting. Seismic facies analysis can be performed using the supervised and unsupervised pattern recognition methods. Each of these methods has its own advantages and disadvantages. In this paper, we compared and evaluated the capability of two unsuperv...
متن کامل